Picture for Yongtao Wang

Yongtao Wang

Feat2Go: Visual Feature-Grounded Value Estimation for Embodied Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

3DVLA: Enhancing Vision-Language-Action Models via 3D Spatial and Instance Understanding

Add code
May 28, 2026
Viaarxiv icon

HiDrive: A Closed-Loop Benchmark for High-Level Autonomous Driving

Add code
May 11, 2026
Viaarxiv icon

VL-SAM-v3: Memory-Guided Visual Priors for Open-World Object Detection

Add code
May 05, 2026
Viaarxiv icon

QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models

Add code
Apr 03, 2026
Viaarxiv icon

ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Add code
Mar 25, 2026
Viaarxiv icon

R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection

Add code
Mar 12, 2026
Viaarxiv icon

YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search

Add code
Mar 10, 2026
Viaarxiv icon

KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System

Add code
Dec 23, 2025
Viaarxiv icon

HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving

Add code
Nov 10, 2025
Viaarxiv icon